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In this paper, we study the nonparametric maximum likeUhood estimator (MLE) of a convex 
hazard function. We show that the MLE is consistent and converges at a local rate of n^^^ at 
points xq where the true hazard function is positive and strictly convex. Moreover, we establish 
the pointwise asymptotic distribution theory of our estimator under these same assumptions. 
One notable feature of the nonparametric MLE studied here is that no arbitrary choice of tuning 
parameter (or complicated data-adaptive selection of the tuning parameter) is required. 
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1. Introduction 

Information on the behavior of time to a random event is of much interest in many fields. 
The random event could be failure of a material or machine, death, an earthquake or 
infection by a disease, to name but a few examples. Frequently, this type of data is called 
lifetime data, and it is natural to assume that it takes values in [0,oo). If the lifetime 
distribution F has a density /, then a key quantity of interest is the hazard (or failure) 
rate h{t) = f{t)/{l — F{t)). Heuristically, h{t)dt is the probability that, given survival 
until time t, the event will occur in the next duration of length dt. The hazard function 
is also known as the force of mortality in actuarial science or the intensity function in 
extreme value theory. 

Certain shape restrictions arise quite naturally for hazard rates. In this work, we are 
particularly interested in the family of hazard functions which are convex. That is, we at- 
tach the additional smoothness constraint of convexity to the more traditional assumption 
of a bathtub-shaped failure rate (that is, first decreasing, then increasing). Heuristically, 
bathtub-shaped hazards correspond to lifetime distributions with high initial hazard (or 
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infant mortality), lower and often rather constant hazard during the middle of life and 
then increasing hazard of failure (or wear-out) as aging proceeds; see [20, 23]. 

Many other estimators of hazard functions (and solutions to the closely related prob- 
lem of estimating the intensity of a Poisson process) with and without shape restrictions 
have been considered in the literature; see [24] for a partial review up to 2002. In recent 
years, the focus has shifted to construction of "adaptive" estimators over large scales of 
smoothness classes; see, for example, [5, 6, 25]. Virtually all of these other estimators 
require careful choice of penalty terms or tuning parameters, and computation of the 
adaptive estimators typically involves methods of combinatorial optimization. As far as 
we know, reliable algorithms for computing them are not yet available. Our estimators 
avoid the choices of tuning parameters or penalty terms by virtue of the shape constraint 
of convexity and are relatively straightforward to compute since the corresponding opti- 
mization problems are convex. 

Recall the definition of a convex function. Let C C M+ = [0, oo) be convex. Then ft, : C i— >• 
M is convex (on C) if it satisfies 

h{\x + (1 - \)y) < \h{x) + (1 - X)h{y), < A < 1, 

for all x,y (Iz C. Equivalcntly, a function is convex if its epigraph 

{{x, n):xeC,fieR,fi> f{x)} 

is a convex set in (see, for example, [27], Section 4). Thus, a convex function on C 
may be extended to a convex function on IR+ by setting h{x) = +oo for x G R+ n C^. 

Suppose, then, that we observe i.i.d. variables Xi,. . . ,X„ from a distribution Fq with 
density /o and hazard rate ho. We denote the true cumulative hazard function by Ho{t) = 
Jq ho{s) ds, and the true survival function by 5o = 1 — -Fq. Also, < < JC(2) < • • • < 
denote the order statistics corresponding to Xi,. . . ,X„. 

To define the MLE of Hq, /i„, we first consider the likelihood in terms of the hazard, 

n n 

C{h) = J] h{X,) exp{-i/(X,)} = n KX(^))cM~H{X(e,)}. 

1=1 i=l 

where Hit) = /J/i(s)ds. This can be made arbitrarily large by increasing the value of 
/i(X(„)). We therefore find /i„ : [0,X(„')) i~> R+ by maximizing the modified likelihood 

n-l 

= n h{X(^))cM-H{X(e>)} X exp{-if(X(„))} (1.1) 

i=l 

over /C+, the space of non-negative convex functions on [0,X(„-)). The full MLE is then 
found by setting hn{x) = oo for all x > X(„-). This is the same approach as taken in [10]. 
Equivalently, one could first impose the constraint that h < M, and then let M — > oo 
(see, for example, [26], page 338). 
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Figure 1. Examples of the estimator. Left: Estimating the HS hazard with fe = and A=l for 
a sample size of 100 (bold = true hazard, solid = MLE). Right: Estimation of the earthquake 
hazard from CPTI04 data (solid = MLE). 



To illustrate the proposed estimator, eonsider the distribution with density given by 

fit) = ^ + '^b ^ on < i < A. 

2A^b-^ + {l + 2h)t/A ~ ~ 

This distribution was derived in [14] as a relatively simple model with bathtub-shaped 
hazards which also has an adequate ability to model lifetime behavior. We will call 
this the HS distribution^ after the authors. It has convex hazards for all values of h in 
the parameter space (6 > — 1/2). Figure 1 shows the MLE for simulated data from this 
distribution with sample size n = 100. 

We also applied our estimators to the earthquake data of the Appennino Ahruzzese 
region of Italy (Region 923) recently considered by [19], where Bayesian estimation meth- 
ods are studied. The data comes from the Gruppo di Lavoro CPTI (2004) catalog [9]. 
It consists of 46 inter-quake times for Region 923, occurring after the year 1650 and 
with moment magnitude greater than 5.1 (details on the justification of these criteria is 
available in [19], page 14). Figure 1 shows the resulting estimator. 

The main results of this paper are the characterizations, consistency and asymptotic 
behavior of the nonparametric MLE of a convex hazard function. The estimator is contin- 
uous and piecewise linear on [0,X(„-)). Although we give a characterization of the MLE, 
the final form of the estimator is not explicit. We therefore propose an algorithm (based 
on the support reduction algorithm of [13]). This algorithm is discussed in a separate 
report, [16], and is available as the R package convexHaz [17]. 

To describe the local asymptotics of the MLE, we introduce the following process. 

Definition 1.1. Let W{s) denote a standard two-sided Brownian motion, with W{Q) = 
0, and define Y{t) = W{s)ds + t^. The function {T{t):t € M}, the invelope of the 
process {Y{t) : t e K}, is defined as follows: 



the function I is above the function Y :X(t) > Y{t) for all t G K; (1-2) 
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the function X has a convex second derivative; (1-3) 
the function I satisfies Jj^ilit) - Y{t)} cll^^) (t)=0. (1.4) 

It was shown in [11] that the process I exists and is almost surely uniquely defined. 
Moreover, with probabihty one, I is three times differentiable at t = 0. The asymptotic 
behavior of all of our estimators may be described in terms of the derivatives of the 
invelope X at zero. The following theorem builds on the basic results in [12] concerning 
nonparametric estimation of a decreasing convex density. 

Theorem 1.2. Suppose that Hq is convex, xq > is a point which satisfies ho^xo) > 
and ft,Q(x'o) > 0, and /lo(-) is continuous in a neighborhood of xq. Then 

fny^iK{xo)-hoixo)) \ ^ /cil(2)(o)\ 

W/Hh'^{xo)-h',{xo))J '{c,i(^Ho)J' 

where l'^^\0) and I'-^^O) are the second and third derivatives at 0, respectively, of the 
invelope of Y{t) = W{s) ds + t'^ and where 

^ ( hl{x^)h'i(x^) \ ^ ( h^{x^)h'i{x^f \ 

2^si{xo) ; ' { 243^0(^0) J ■ 

The key to this result lies in Lemma 5.3, where we establish that the "touchpoints" 
(defined carefully in Section 2) cluster around xq at a local scale of The assumption 

that Hq is strictly convex and continuous near xq is crucial in this step. If /iq(xo) = 
(and is continuous in a neighborhood of a;o), we conjecture that hn{xo) converges at the 
rate n^/^. Similar behavior has been noted for monotone density estimators in [8]. If /ig is 
discontinuous at Xq, unpublished work of Cai and Low [7] suggests that hn{xo) converges 
to ha{xa) at rate ji^/^^ xhe behavior of convex-constrained estimators in both of these 
situations remains unknown and is the subject of current research. 

The limiting distributions of hn{xo) and h'„{xo) involve the constants ci and C2, 
which depend on the (unknown) hazard function Hq, as well as the random variables 
(l(^)(0),l'3)(o)) which have a universal distribution free of the parameters of the prob- 
lem. Thus, Theorem 1.2 can be used, in principle, to form confidence intervals for 
ho{xo) and /ig(a;o). This would involve estimation of the constants ci = ci{ho,Xo) and 
C2 = C2{ho,xo), respectively, both of which depend on hQ^xa), and appropriate quantiles 
of the distributions of I^^^ (0) and X^^) (g) , respectively. Although virtually nothing is 
known about the distribution of the invelope and its derivatives analytically, the algo- 
rithms developed in [11] can easily be used to obtain simulated values of the needed 
quantiles. Other possible approaches to confidence intervals in this problem involve in- 
version of likelihood ratio tests (see [2-4] for this approach in the context of monotone 
or [/-shaped function estimation) or resampling methods as in [22] and as discussed in 
[4] in the setting of nonparametric estimation of monotone functions. It should be noted 
that our Theorem 1.2 verifies one of the key hypotheses needed for validity of the gen- 
eral subsampling theory of [21, 22], and therefore makes the subsampling approach to 
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confidence intervals viable. The details and properties of all these approaches remain to 
be investigated. 

The outline of this paper is as follows. Section 2 is dedicated to the proof of character- 
izations, existence and uniqueness of the MLE. Consistency is proved in Sections 3 and 4 
establishes lower bounds for the pointwise minimax risk of Rates of convergence are 
established in Section 5, with Section 5.3 containing proofs of our main results concern- 
ing the limiting distribution at a fixed point. The companion technical report [15] also 
includes a detailed treatment of a least-squares estimator, as well as sketches of similar 
results for censored data and intensity functions of Poisson processes. 

2. Characterizations, uniqueness and existence 

Proposition 2.1. The function hn which minimizes ipn over /C_|_ is piecewise linear. It 
has at most one change of slope between observations, except perhaps in one such interval, 
where, if the estimator touches zero, it may have two changes of slope ( it is zero between 
these two changes). Also, between zero and ^(i), the minimizer may have at most one 
change of slope, but this happens only if it touches zero, and in this case the estimator is 
increasing and equal to zero before the first change of slope. Between X(„_j) and 
the minimizer will also have at most one change of slope, and this only in the case where 
it is decreasing on [^(,i_i),^(„)) and equal to zero after the change. 

Proof. Consider any h and choose a convex g such that h{Xi) ~ g{Xi) for i = 1, . . . , n — 1 
and h{x) > g{x) > on [0, It follows that (pn{h) — fnid) > if and only if H{Xi) > 

G{Xi) for i — 1, . . . ,n. Hence, the smaller we make g on [0, ^(n)), the smaller (fnig) will 
become. It is not difficult to see that the smallest such g, with values of g{Xi) fixed, must 
have the prescribed form. □ 

Since hn is piecewise linear, it may be expressed as 

k m 

hn{t)=Z + J2 % (^J - ^)+ + H - )+ , (2.1) 

where Vj,a,Jlj > 0. We let Tj denote the points of change of slope of hn where hn is 
decreasing and let rjj > denote the points of change of slope where h„ is increasing. For 
simplicity, we assume that these are ordered. Also, we have < 771. As seen in the next 
lemma, the r^'s and ?7j's correspond to "points of touch" or equality of processes defined 
on the one hand in terms of hn and the data, and on the other hand just in terms of 
the data. We therefore also refer to them as "touchpoints" repeatedly in the remainder 
of the paper. 

It is convenient to define the MLE in terms of the minimization of the criterion function 

/>oo 

Mh)^ / {H{t)~\ogh{t)lt^x,Jd¥n{,t), 
Jo 
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where F„ denotes the empirical distribution function of the data, 

1 " 

n ^ — ' ^ ' 

i=l 

We will also use the notation §„(t) = 1 — F„(t) for the empirical survival function. 



Lemma 2.2. Let F„(t) = ^lo,t]{X(i)) . A function hn minimizes (pn over IC+ 

(and hence is the MLE) if and only if: 

x-t 



dF„(t)= / / Snis)dsdt (2.2) 

hn{t) Jo Jo 

for all X > Owith equality at Ti for i = 1, . . . , fc; 

t-x ~ /-oo ^oo 

dF„(t)= / / §„(s)dsdi (2.3) 

X hn{t) Jx Jt 

for all X > Owith equality at rjj for j = 1, . . . , m; 
, J— dF„(t) < / Sn{t)dt, (2.4) 

lo hn{t) Jo 

) 

i?„(<)dF„(t) = l-l/n. (2.5) 

Moreover, the minimizer hn satisfies 

hn{t)Sn{t)dt^¥n{x) (2.6) 

Jo 

for Xe {Tl,...,Tk,T]l,...,r]m}- 

Remark 2.3. As wc assume a priori that ft,„(X(-„)) =oo, we may rewrite the left-hand 
side terms in (2.2)-(2.4) via 

^dF„(t)= / ^l,^^^„,dF„(t)= / f--^dF„(t), 

A hn{t) J A hn{t) J A hn{t) 

I A hn{t) J A hn{t) J A hn{t) 

for any set A. We will hereafter use this latter formulation. 

Corollary 2.4. Let {Ti]^^i and {r]j}"Li denote the change points of hn as in (2.1). It 
follows that 

^ dF„(t)= f^Sn{u)du, (2.7) 



hn{t) 
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dF„(t)= / §„(u)du (2.8) 



for i = 1, . . . ,k and j = 1, . . . ,m. 
Proof. The function 

(t>{x)= r z^d¥„{t)~ r r§„(s)dsdt 

Jo hn(t) Jo Jq 

is maximized at for i = 1, . . . , fc. Since it is also differentiablc, (2.7) follows. A similar 
argument proves (2.8). □ 

Proof of Lemma 2.2. Consider any non-negative convex function h. It follows that 
there exists a non-negative constant a and non-negative measures v and /i (these measures 
have supports with intersection containing at most one point) such that 



h{t) = a+ / {x -t)+diy{x) + / {t - x) + dfi{x) . 
Jo Jo 

For any function h in /C_|_ , we calculate 

Mh) - ^h) > l^[H{t) - Hit) + (^1 - III) l,^x,„, I dF„(i) 

since — log 2; > 1 — a;. Plugging in the explicit form of h from above, we find that the 
right-hand side is equal to 

\ / §„(s)dsdt- / ^lt^x,„,dF„(i) Idi^(x) 
Uo Jo Jo h{t) J 

/ / S„(s)d.sdt- / ■.^U^x^„,d¥n{t)\d^l{x). 
Ux Jt Jo h{t) ) 

This is non-negative if /i is a function which satisfies conditions (2.2)-(2.5). It follows 
that these conditions are sufficient to describe a minimizer of ipn ■ 

We next show that these conditions arc necessary. To do this, we first define the 
directional derivative 

Lp„{h + ej)- Lp„{h) _ f°°(-^,,.. 7(0 



d^^nih) EE lim ' = |r(t) - ^^1mx,„, \ dF„(0. (2.9) 
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If hn minimizes ipn, then for any 7 such that /i„ + £7 is in /C-|- for sufficiently small e, 
we must have 9^iy9„(/i„) > 0. If, however, h„ ± ej is in /C+ for sufficiently small e, then 
d-f(pnihn) =0. 

If we choose, respectively, j{t) = 1, (t — (y — then /i„ + £7 is in /C+ and we 
obtain the inequalities in conditions (2.2)-(2.4). Since (1 ±e)/i„ is also in IC+, for suffi- 
ciently small e, we obtain (2.5). Choosing 7 = (r^ — t)+, (t — ?7j)+ yields the equalities in 
(2.2) and (2.3), respectively, since for each of these functions, hn ±£7 is in JC+. 

Lastly, we prove (2.6). For any r,;, define 

7(t) = "■'^"(■^0) forte[0,rj], 

\ 0, otherwise. 

Since (1 ± 5)7 is also in /C+, it follows that (9^(p„(/i„) = and hence 

' Hn {t) dWn {t) - F„ in ) + Hn (t, )§« (t, ) 



+ hn{n)\ f .J—d¥n{t)-f td¥n{t)-nSn{T,) 
Uo hn[t) Jo 

Integration by parts and Corollary 2.4 yield (2.6) for x = r,;. The case where x = rjj is 
obtained in a similar manner, but using ^(t) — {hn{t) — /i„(?7j))l(^^_oo)(i) a-nd (2.5). □ 

The next corollary allows us to extend the equalities of the characterization of the 
MLE to some extra touchpoints. The significance of these equations will become clear in 
Section 5, where we consider asymptotics of the estimator. 

Corollary 2.5. Suppose that hn is strictly positive and recall the formulation given in 
(2.1). Then we also have that 

dF„(i)= / / §„(u)duds, (2.10) 



hn(t) Jq Jq 

t-Tk 



00 /'OO 



Tfc hn{t) JTk 
1 



hn{t) 
1 



d¥n{t)^ J J Snis)dsdt, (2.11) 

d¥n{t) = f '§„(u)du, (2.12) 



d¥n{t) = / Sn{u)du. (2.13) 



Vj hn {t) J ijj 

Proof. The first two equalities follow by noting that if hn is strictly positive, then for e 
sufficiently small, /i„ ±£7 is in K.^ for ^(t) = [t — Tfc)+, (ryi — t)j^. Arguing as for Corollary 
2.4 proves the remaining identities. □ 
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Proposition 2.6. There exists a unique minimizer hn of tpn over /C+. 

Proof. Wc will show that a minimizer exists by reducing the search to bounded positive 
convex functions on a compact domain. As this is a compact set, under the topology of 
uniform convergence, a minimizer of ipn exists (see [27], Theorems 10.6, 10.8 and 27.3). 

Wc must first handle the issue of a compact domain. As we assume a priori that 
/i„(X(„-)) = oo, we are really looking for the minimizer of the modified negative of the 
log-likelihood with domain [0,X(„)). However, we have also argued that the minimizer 
must have the specific functional form as described in Proposition 2.1. Therefore, it is 
sufficient to reduce the domain to [0, A"(„_i) +(5], for any (5 > 0, since h„ is then extended 
linearly beyond + (5 in a unique manner. It will therefore be sufHcient to show 

that we may reduce the search to functions bounded on [0,X(„_i)], with a derivative at 
which is bounded above. 

Recall that the minimizer must satisfy (2.5). Wc therefore reduce our search to the 
class of functions which satisfy this condition. For any such h, write h = + h^, where 
h-i- is increasing and /i_ is decreasing. It follows that for any x, 

1> / H{t)d¥n{t)^ / h{t)Sn{t)dt>h_{x) / S„(i)di. 
Jo Jo Jo 

A similar bound for yields 

h{x)<(^J^ Snit)d?j §„(i)di^ =M„(x) (2.14) 

for all X in (0,X(„)). Thus we know that h{x) must be bounded for x £ {Q,X(n-i)]- 

To show that h is also bounded at zero, we need to show that /i'(X(i)) is bounded 
from below. Assuming that it is negative, we may write, for < x < -'^^(i), 

hiX(^,)) + h'iX^,)){x - X(i)) = hix) < M^x). 

Fixing X* > Q and less than wc then obtain that 

/^'(^(l)) > {Mn{x*) - h{X(^)))l{x* - X(i)), 

from which it follows that h must be bounded on the set [0,X(„_i)]. 
By (2.5), we also have that 

n>i?(X(„))> / h{t)dt= I ""^ {/i(X(„_i)) + /^'(X(„_i))(i-X(„_i))}dt 

if h is increasing on [X(„_i),X(„)). This implies that /i'(X(„_i)) is bounded above, com- 
pleting the proof. 

We now show uniqueness. Suppose that hi and /12 both minimize ipn- Then, by (2.5), 
Lpnihi) and ipn{h2) differ only in the term — \oghi{t)\{t ^ ^(n)) dF„(t). However, this 
term is strictly convex and it follows that hi{X^{^) = /i2(^(i)) for all i = 1, . . . ,n — 1. 
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Let h = {hi + h2)/2. By linearity, we have that ipn{hi) = ip{h2) = fih), which implies 
that h is also a minimizer. However, the only way that this is possible is if h also satisfies 
the conditions of Proposition 2.1. This implies that one of the following holds: 

1. Both hi and ft,2 are increasing and /ii(0) = /i2(0) = 0. In this case, they must have 
the same locations for their changes of slope, as otherwise h violates Proposition 2.1. 

2. Point 1 above does not hold. Then, by the same argument as above, if hi and /12 
have at least one change of slope in an interval between observations (or between 
zero and ^(1)), then these locations of change of slope must be equal. 

If the first case holds, then it is not difficult to see that hi = h2 on [0,X(„)), as 
hi{t) — hi(t) = on [0,Ti] and hi{Xi) ~ h2{Xi) for all observation points. 

In the second case, we use a different argument. We know that neither hi nor /12 have 
touchpoints before X(iy Let t* denote the first touchpoint of hi and (without loss of 
generality) assume that the first touchpoint of /12 is greater than t* . Hence, by (2.6), 

hi{X^i))^h2{X^i)), I hi{t)dt = ¥^{t*). 

Jo 

Now, h = {hi +/i2)/2 and /12 are also minimizers of the MLE criterion function (^„. Also, 
h has a touchpoint at t* and /i(X(i)) = /i2(X(i)). 

Averaging h with /12 yields the functions hi — 2^'(/ii — /i2) + /i2, which satisfy 

/i/(X(i)) = /i2(^(i)), / hi{t)dt = ¥^{t*) foralU>l. 

JO 

Since hi — > /12 pointwise, it follows from the dominated convergence theorem that 
/p h2{t) dt ~ Wn{t*). Therefore, since hi and /12 are both linear on [0,^*] with hi{X(^i^) = 
^2(^(1)) and hi{t)dt = /i2(i)dt, they must have both the same value and slope 
at That is, both /ii(X(i)) = /i2(^(2)) and /ii(-'^(i)) = /i2(-^(2)) hold. 

Now, write 

Tn 1 — 1 

hi{t) ^ ai + bit + ^ lyi^l{t-t^^l)+, 

i=\ 
m2 — l 

h2{t) = a2 + b2t + ^ Vi^2{t-ti^2)+, 
i=\ 

where X{X) < iij < hj < ■ ■ ■ < tni,~i,j < X(n), j = 1,2, and where hi{X(i)) = h2{X(i)) 
for i = 1, . . . ,n. We also assume that > for i = 1, . . . , nij — 1, j = 1, 2. This implies, 
in particular, that hj{t) = Oj + bjt for t < tij, j = 1,2, and since ^(1) < tij, j = 1,2, 
hi{X(^i'f) = h2{X(^i-j). Thus oi =02 + 62-'^(i) ■ From the argument above, it follows 

that bi = h[{X(^i'f) = h2{X(i-f) = 62. We conclude that ai = 02 and bi = 62 so that hi{t) = 
h2{t) for < t < t*. It also follows that ti^i = ti^2- 
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Repeating this argument on the interval with t** = min{i:2,i,i2,2} shows that 

vi^i = vi.2 or t2.i = t2.2- Proceeding by induction yields Vj,i = Vj_2 and ~ tj+1,2 for 

j = 1,. . mi ~ 1 — 1712 — 1, hence uniqueness. □ 



3. Consistency 

Theorem 3.1. Suppose that Xi, . . . ^ Xn are i.i.d. random variables with convex hazard 
function ho and corresponding distribution function Fq. Let Tq = To{Fo) = mf{t : Fo{t) = 
1} . The MLE hn{t) is then consistent for all t £ {0,Tq). Also, for all 5 > 0, 

sup \hnit) — h{t) \ ^ 0, almost surely, 

5<t<To-S 

if Tq < 00. //To = 00, the above statement holds with Tq — S replaced by any K < 00. 

Remark 3.2. If /iq is increasing at 0, then one can show that /i„ is not consistent at 
zero. This is a frequently occurring difficulty of shape constrained estimators; see, for 
example, [1, 12, 30]. 

Proof. We first show that /i„ is bounded appropriately so that we can select convergent 
subsequences. Decompose /i„ into its decreasing and increasing parts: /i„ = /i„4 + hn^^. 
Then, arguing as in (2.14), it follows from (2.5) that 

hn,i{x) < ^ , (3.1) 

where the right-hand side is almost surely bounded and, in fact, converges almost surely 
to 1/ /g^ S'o(i) dt<oo for all x>0. Also, 

where the right-hand side is almost surely bounded for x e (supp(i^o))° a-nd converges 
almost surely to 1/ So{t)dt < 00. 

Now, take 7 = ft-o in the directional derivative (2.9). It follows that 

e Jo I /i„(i) J 

noting that /i„(X(„)) ~ 00, and hence 

ho{t) 



h„{t) 



dF„ {t)< Ho{t) dF„ (t)^ Ho{t)dFo{t) = l. 
Jo ^■^■Jo 
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Fix any < a < b < oo such that a, & £ (supp(Fo))°. It follows that lini„ X(„) > b with 
probability one (this can be shown using the Borel-Cantelli theorem). Also, sup |F„(t) — 
Fo{t) \ — >a.s. by the Glivenko-Cantelli lemma. Both of these events occur on the set J7, 
with P(r2) = 1. Fix Lu Gil. We will show that /i„ for such an uj. 

Let {n'} denote any subsequence of {n}. By the bounds in (3.1) and (3.2) (which are 
finite for our choice oi uj), using a classical diagonalization argument and the continuity 
of convex functions, we may extract a further subsequence {n"} such that hn" — > h 
pointwisc on [a, 6], where the limit h must be convex. We denote this subsequence {n} 
to simplify notation. 

From Fatou's lemma, it follows that 

' ''"(^^ Soit) dt^ fb^ m dt < hminf /' ^ dF„(t) 



(t) J a hn{t) " J a /l„(t) 

< lim sup / dWn {t)< hm / Ho{t) dF„ (i ) < 1 . 

n Jo hn{t) " Jo 

Note that this implies that if h{t) = 0, then ho{t) = 0. By (2.5) and integration by parts, 
we see that 1 > Jj^ ft,„(t)S„(t) di. Therefore, again applying Fatou's lemma, 

1> f h{t)So{t)dt. 

J a 

It also follows that 

Ja h{t) 



h{t)So{t)dt^2 I hoit)Soit)dt+ I lj^Soit)dt 



6_ ,b 

a h{t) 

< 2-2 / ha{t)SQ{t)dt. 

J a 

Define h ~ ho for t ^ [o-,b], which allows us to let both 1/a and 6 — > oo in the above 
display. Since ho{t)So{t) dt = l, it follows that 



h{t) 

and this implies that h{t) = ho{t) for all t G [a,b]. 

Wc have thus shown that every subsequence {hn{x)} has a further subsequence which 
converges to the true hazard function holx) pointwisc, for all x€ (supp-Fb)°- It follows 
that {hn} converges to ho pointwise. By Theorem 10.8, page 90, [27], this implies that 
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the claimed uniform convergence on [a, h] also holds. As this happens for any cj G fi, and 
P(ri) = 1, we have proven the result. □ 

Corollary 3.3. Suppose that h'^ is continuous and strictly positive at xq G (supping)"- 
It follows that there exist touchpoints Tn < Xq < rjn such that t„ , rjn — > Xq in probability. 

Proof. Let 7?„,t„ be touchpoints such that r„ < xq <rjn. If t„ does not exist, then set 
r„ = 0, and i]n = oo otherwise. Suppose that it is not the case that Tn,r]n — >p xq. It then 
follows from Theorem 3.1 that there exists an interval / = [a,b] such that xq G I for 
|/| > 0, limsup„r„ < a and liminf„77„ > b almost surely, and, lastly, lim/i„(i) — >a.s. ^o(^) 
on /. However, this implies that /io(i) is linear on /, which is a contradiction. □ 

From consistency of the estimator, we also obtain consistency of the derivatives. 

Corollary 3.4. Suppose that x G {a,b) and sup£j<j<t, |/i„(t) — ^o(i)| ^a.s. 0. Then 
h'^{x) -^a.s. hQ{x) at all continuity points x of Hq on (a, 6). 

This follows from the following simple result for convex functions, proved in [12]. 

Lemma 3.5. Suppose that hn is a sequence of convex functions satisfying sup^<^<j \hn{t) - 
/io(t)| — >■ with probability one. Then (also with probability one), for all x G (a, 6), 



-oo < /ig(a; ) < lim inf /i^j (x ) < limsup/i^(a;"'") < /iQ(a;''") < oo 

n— >oo 



4. Asymptotic lower bounds for the minimax risk 

Define the class of densities C by 

C = (/:[0,c^)^[0,c»): / /(a;)da: = l, 



L Jo 

h{x) — f{x)/{l — F{x)) is convex, h{x) > for all a; > 

We want to derive asymptotic lower bounds for the local minimax risks for estimating 
the convex hazard function h and its derivative at a fixed point. The ii-minimax risk 
for estimating a functional T of /o based on a sample Xi , . . . , Xn of size n from /o which 
is known to be in a subset C„ of C is defined by 

MMiZi(n,r,C„)=inf sup Ef\Tn-Tf\, (4.1) 
^" fec„ 



MLE & LSE of a convex bathtub hazard 1023 

where the mfimum ranges over all possible measurable functions r„ = tn{Xi, . . . , Xn) 
mapping M" to M. The shrinking classes C„ used here are HcUinger balls centered at fo, 

Consider estimation of 

7^i(/) = T^^ = M^o), m)^h'{x,). (4.2) 
1 - F(.To) 

Let fo € C and xq > he fixed such that Hq is twice continuously differentiable at xq ■ 
Define, for e > 0, the functions as follows: 

{ho{xf) — ece) + {z — xo+ £Ce)/io(a;o — ece), z g[xo — eCe,xo — e], 
ha{xa + e) + {z - xq - e)/io(a;o +e), z £ [xq - e,xo + e], 

hf){z), otherwise. 

Here, is chosen so that is continuous at xq ^ e. Using continuity of /i^ and a second 
order expansion of Hq, it follows that = 3 + o(l) as £ ^ 0. Now, define /e by 

fe{z) = cxp{-H^{z))h^{z), 

where H^{z) = h^{u) du. It follows easily that 

Tiife) - Ti(/o) - ^h'^{xo)e' + o(£2), (4.3) 
T2{fe)~T2ifo) = h'^ixo)e + oie). (4.4) 
Furthermore, the following lemma holds. 

Lemma 4.1. Under the above assumptions, 

^2 ^ 2 h'^ixony Fixo)) ^, ^ ^ ^ , 

5 ho(xo) 

Proof. The lemma follows from Lemma 2 of [18] and 

{Mx)-fo{x))^ ^ 16 h'^{xoril~F{xo)) ^, ^ ^^^5^^ 
foix) 5 ho{xo) 

This is achieved by careful calculation. □ 
Combining (4.3) and (4.4) with the lemma, it follows that 

- T.(/o)l > (^^2^;g2£) '".'"(l + 0(1)). 
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From these calculations, together with Lemma 5.1 of [12], we have the following result. 
Along with Theorem 1.2, it indicates that hn{xo) and h'^{xo) achieve optimal rates and 
also have the correct dependence on the parameters h"{xo) and h{xo) (up to absolute 
constants). 

Theorem 4.2 (Minimax risk lov^rer bound). For the junctionals Ti andT2 as defined 
in (4-2), and with MMRi{n,T,Cn.T) as defined in (4-1), 

suphmsupTi ' MM/ci(n, Ji,C„,r) > - — _ and 

T>o n-i-oo ' 4 \ S'o(a;o)c8v2 / 

suplimsupnV5MMi?i(n,T„C„,.) > ^"^fjf ^ V^'. 

T>0 n^oo A\Ac 2So[Xo) ) 

In particular. Theorem 1.2 shows that the MLE achieves the optimal pointwisc rate of 
convergence, n^/^, at points xq with h"{xo) > 0. Convergence rates over the larger class 
of bathtub-shaped functions would be slower: the MLE of a [/-shaped hazard is known 
to converge locally at rate n^/'^; see, for example, [2]. 

5. Rates of convergence 

In this section, we identify the local rates of convergence of the MLE. Fix a point xq € 
(supp_Fo)°- To obtain the results, we assume that /lo(') continuous and strictly positive 
in a neighborhood of xq and that h(xo) > 0. 

5.1. Some useful estimates 

For <x <y, define 

Jx (. hn{Z) ) 

Lemma 5.1. Let xq G (suppi^o)°- Then, for each e > 0, there exist constants 5,co,no 
and (positive) random variables Mn (independent of x,y), of order Op(l), such that for 
each I a; — .To I <5, 

\Un{x,y)\<e{y-xf +n-^'HU, 0<y-a;<co, (5.1) 

for all n>nQ. 

Proof. Note that L/„ = (P„ - Po)i9^^y^JiJ, where 

/ \ fx,y{z) 

gx,y,h{z)= \x,v]{z) 
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and, in view of the consistency established in Theorem 3.1, /i„ is a convex function 
uniformly close to kg on neighborhoods of xq ■ This leads to the consideration of the class 
of functions 

^ J z gx,yji{z) ■.x<y<x + R,h convex, 

with 7 = m{^„_s<x<xo+5+co ho{x)/2 and we define Gn = {\\hn - hoW^tl'^''" < 7}. The 
class Ta:,R has an envelope function F^^uiz) =J~^{{z - x)l[^^a:+B]{z) +2^^Ri[x,x+R]{z}} 
and hence the following second moment bound holds: 

E{[F,,Hn [{z ~x) + R/2]^h{z) dz < ^\\f,r^+J^R\ 

7 J[x,x+R] -1-^7 

Furthermore, \ogN\^^{e,J-x.R,L2{PQ)) I^K/e^^"^ for some constant K by [29], Theo- 
rem 2.7.10, page 159, and a straightforward bracketing argument. It then follows from 
[29], Theorems 2.14.2 and 2.14.5, pages 240 and 244, that 

£;{( sup |(P„-Po)(./)|)'}<^A"i?{[F,,fl(Xi)]2} = 0(n-ii?3). (5.2) 

Define M„(u;) as the infimum (possibly +00) of those values such that (5.1) holds and 
define Ain.j) to be the set [{j — l)n^^/^, jn^^/^). Then, for m constant, 

P(A/„ > m) < P([Af„ > to] n G„) + P{Gl) 

< P{[3u:\Unix,x + u)\ > eu^ + 7i-*/^7i] n G„) + P(G,"J 

< P{[3u e A{n,j) : n4/5|C/„(a;, x + u)\ > e{j - 1)^ + to] n G„) + P(G,=J. 

The jth summand is hence bounded by 



sup \Un{x,x + u)m\^lG^/[m + e{j -l)'^]^ <C 



3 



l-ueA(«j) ' "J [TO + e(j-l)4]2 

due to (5.2). Thus it follows, using Theorem 3.1 to conclude that P(G^) — > 0, that 



limsupP(Af„ > to) < G^ y 

n— J-oo . - 

.7^1 



TO + e(j-l)4]2' 

where the sum in the bound is finite and converges to zero as to ^ 00. This completes 
the proof of the claim. □ 

A similar approach proves the following for the function 

Vn{x,y) = Ciz -{x + j/)/2}(§„(z) - 5*0(2)) dz. 
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Lemma 5.2. Let xq £ (supp^o)°- Then, for each e > 0, there exist constants i5, cq > 
and (positive) random variables Af„ (independent of x,y) of order Op{l) such that for 
each \x — xo\ < S, 

\Vn{x,y)\<en-^/^{y-x)^ + n-^Mn, 0<y-x<co. (5.3) 



5.2. Asymptotic behavior of touchpoints and resulting bounds 

Lemma 5.3. Let Xq > be a point at which ho has a continuous and strictly positive 
second derivative, and where h{x()) > 0. Let ^„ be any sequence of numbers converging 
to Xq and define Tn and rjn to be the largest touchpoint of h„ smaller than ^„ and the 
smallest touchpoint larger than respectively. Then 

??« -T„ =0p(n-i/5). 

Proof. By Theorem 3.1, we know that h„ is positive near xq for large enough n. Also, it 
is either strictly increasing or strictly decreasing in a neighborhood of xq, or it is locally 
flat. If hn is decreasing between r„ and then (2.7) and (2.2) with equality at both r]n 
and r„ hold. If, instead, /i„ is increasing, then (2.8) and (2.3) with equality at both i]n 
and Tn hold. There is only the potential for a problem in the locally flat case. However, 
since is strictly positive, by Corollary 2.5, we can extend the necessary equalities to 
this case as well. Therefore, we need only consider two cases, /i„ is either non-increasing 
or non-decreasing on [r„ , rjn] . 

We first assume that /i„ is non- increasing on [r„,77„]. Define 

^n,dz)^ f ^-^d¥„{t) and A„,;(z) = / S„(t)dt, (5.4) 

Jo hn{t) Jo 

and let m„ be the midpoint of [t„,77„], m„ ~ (t„ -|- 7y„)/2. We may then calculate 



Un.iimn) = / " dWnix) + 'Hn,l{r/n) ~ {Vn ~ "ln)'H^._|_(r?„) 

■ dF„(a;) -I- ■H„,4,(r„) -I- (m„ - t„)'H^_j_(t„). 



hn{x) 



From (2.2), we know that 2'H„.4,(m„) < 2 J^" An,i{t)dt. The equality in (2.2), together 
with (2.7), allows us to rewrite this as > Li,^ + ^2,^, where Li^^ is equal to 

^^dF„(x) + ra^dFnix) !^{K,,(r;„) -K,,(r„)} 

- x - (l/2)(,„ mn) ^ (l/2)(r.+^.)-x 



hn{x) Jt„ hn{x) 
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i(7]„ +m„)ls„(x)dx+ / j ^(t„ + m„) -X i-§„(a:)da: i-, 



by integration by parts. 

Now replace F„ by the true Fo in the definition of ii,^ to obtain 

Ll, . r ^i^liafc^dFo(.) + il/^l(^±^^dfo(x) 

J m„ hn (x) Jt„ hn{x) 

i(r„ + m„)-.}{^-^}dfo(.)-LO„ 
where 

^2,1 = - J ja; - i(7y„ +m„)|s'o(a;)dx- y" |i(r„ + m„) - x|s'o(x-) dx. 

Next, using a Taylor expansion of order 2 on the function — l/ft.o(a;) and about 

the point m„ , we obtain 

Lli + Lli - - ^")' + °((''" - ^")') (5.5) 

since both /i„ and are consistent by Theorem 3.1, h'^ix) = on (t„,77„) and because 
Tn — Vn = Op(l) by Corollary 3.3. Therefore, by Lemmas 5.1 and 5.2, together with the 
above calculations, we may write 

> + - - T„y - Op(n-4/5) - en~'/'{T^„ - T,,f - Op{n-') 
= Y^{5||^/K) - 192e|(,y„ - r„)4 + oHvn ~ r^t) - 0,{n~^h- 

We choose e sufficiently small (so that the leading term in the last line of the above 
display is positive) and hence conclude that (rjn — t„) = Op(n~^/^). A similar approach 
proves the non-decreasing case. □ 
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Lemma 5.4. Let ^„ he a sequence converging to xq. Then, for any e > 0, there exists 
an M > 1 and a c> such that, with probability greater than 1 — e, we have that there 
exist change points Tn < £,n < Vn of hn such that 



inf \h„{t) ~ ho{t)\ < cn-'^/^ 

i6[r„,j;„] 



for all n sufficiently large. 



Proof. Fix e > 0. From Lemma 5.3, it follows that there exist touchpoints rjn and t„, and 
an M > 1 such that ^„ - Mn'^/^ <Tn< f„ - n^^/^ < ^„ + n'^/^ < < ^„ + A/n^i/^ 
Fix c > and consider the event 

inf \hnit)-ho{t)\>cn-^/^. (5.6) 
First, assume that hn is non-increasing on [r„,?7„]. On this set, we have that 

..hn{t)-ho{tl 

(Vn-t) Sa(t)dt 

hn[t) 

> Bcn^^^''{r]„ - T„f > Ben 



4/5 



where B is some constant depending on xq- Using the definitions in (5.4), as well as the 
equality in condition (2.2) with (2.7), it follows that 

O^iin.iiVn)- I' A„4(t)dt-H„4(T„)+ / A„4(t)dt 



(K.i('rn) - A„._^(r„))(7^„ - r„) 

dF„(i) - (7M - tPn{t) dt 

''"(^„_t)M<h^M)5„(t)dt+ r^t-^d¥n{t)- r{vn-t)Mt)dt, 

where F„(t) = F„(t) — Fo{t) and S„(t) = Sn{t) — So{t). By the assumption on hg and xq, 
and arguments similar to those used for Lemmas 5.1 and 5.2, we can show that 

(Vn-t) So[t)dt^Op{n ' ), 

hn[t) 

which is a contradiction to (5.6) if c is chosen large enough. A similar argument completes 
the proof for the non-decreasing case. □ 

The next proposition is the key to proving tightness in the next section. The results 
follow from the previous lemmas and make extensive use of the underlying convexity. 
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Proposition 5.5. Under the assumptions of this section, we have that, for each M > 0, 
sup \hn{xo + n-^'h) - /lo(a-o) - n-^'hh'^{xa)\ = Oj,(n-2/5), (5.7) 

|i|<A/ 

sup K(xo+n-i/4)-/i[,(xo)| =0p(n-i/5). (5.8) 

\t\<M 

Proof. For M,e > fixed, define rjn^i to be the first point of touch after xq + Mn~'^/^ 
and Tjn^i to be the first point of touch after rjn.i-i +n^^^^ , i = 2, 3. Define the points rin,-i 
for i = 1,2,3 similarly, but working to the left of xq. By Lemma 5.4, there exist points 
Cn,i G iiln,i,iln,i+i), i = 1)2,-2,-3, and a constant c > such that with probability at 
least 1 — e, we have that \hn{(,n,i) — ho[S,n,i)\ < cn^"^!^ . 

As hn is convex, it follows that for any t^\xQ — Mn~^l^ ,xq + Mn~^l^\, 

^ feo(e«,2)-/to(C»4)+2cn-^/5 

</i;,(e„,2) + 2c7i-i/5 

since ^„^2 — 61,1 > n^^^^, where /i^(i) denotes the right derivative at t. Because of the 
continuity of h'^i/) near xq, we may replace hlf^i^^nj) with ft,Q(.To) +cn~^/^ for some new 
constant c. The result follows. A similar argument shows the lower bound. 

We now consider (5.7). By Lemma 5.3, there exists a constant K > M such that 
there exist two touchpoints in [xq + Mn~^/^,xo + Kn~^/^], n''^/^ apart with proba- 
bility 1 — e. The same is the case in the interval [xq — Aln^^^^ ,xo — Kn~^/^]. From 
Lemma 5.4, it follows that there exist points ^„^i € [xq + Mn^^^^^ ,xq + Kn^^^^^] and 

Cn,2 € [xq - Mn^^/^,XQ - Kn~^/^] such that \h„{^n,i) - /io(6i,i)l < cn^"^/^, for i = 1,2, 
with probability at least 1 — £ and sufficiently large n. Lastly, we have already shown 
that there exists a c' such that with probability at least 1 — e, 

sup \K{t)-h'^{xo)\<c'n-^/\ 

Therefore, with probability at least 1 — 3e, we have that for any t^[xQ — AIn^^/^,xo + 
Mn^^/^] and sufficiently large n, 

Kit) > /i„(en,l) +Ki^ns)it - Cns) 

> h^i^ns) - cn-^^' + (h'oixo) - c'n-^/'>){t - i^s) 

= h„{xo) + h'{xo){t - .To) + ^h'\x*o){^r^,i - xof - cn-^/^ - dn-^l\t - ^m) 

> /io(a;o) + h!{x^){t - .tq) - Sn-^/^ 
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for some constant i? > 0. A similar argument proves the other direction. □ 
5.3. Limit distribution theory at a fixed point 

From Lemma 5.3, we know what rescahng is necessary to pick up a meaningful limit. 
The idea of the proof is now to write carefully a local version of the characterization of 
the MLE, Lemma 2.2, and to show that in the limit, these become the characterization 
of the invelope (Definition 1.1). The invelopc !{■) is described in terms of the "driving" 
process Y{-). Our goal will then be to identify the two processes, one which converges to 
the invelope and another which converges to the driving process Y . 
Note that at (where h"{xo) > 0), we have three possibilities: 

1. h'Q{xo) > 0: By continuity, ft.o(x) > in a neighborhood of xq. It follows from the 
consistency of the MLE derivatives that ft.^ > for sufficiently large n and hence 
all touchpoints to be considered are of the "increasing" kind. 

2. HqIxq) < 0: By the same argument, all touchpoints are decreasing. 

3. ft.o(a;o) = 0: Since h{xo) > 0, by Corollary 2.5, there is always at least one touchpoint 
which satisfies both the non-increasing and non-decreasing properties. The limiting 
process may then be "stitched" together in an appropriate manner. 

Therefore, it will be sufficient to prove the asymptotic results for both types of touch- 
points: non-increasing and non-decreasing. For the sake of brevity, we outline the argu- 
ment only for the non-increasing setting. 

For any interval [a,b] C M, let D[a,b] denote the space of cadlag functions from [a,b] 
into M endowed with the Skorohod topology and C[a, b] the space of continuous functions 
endowed with the uniform topology. 

Driving process. 
Define 

B„(i) = V^(H„(t)-ffo(t)), (5.9) 

where H„ is the empirical cumulative hazard function, defined by dM.„{u) = (1 — 
F„(t-))"MF„(t). From [28], Chapter 7, Theorem 7.4.1, page 307, we know that for 
tG{0,To) with To = To{Fo) = mi{x:F{x) = l}, B„(t) ^ B(C(t)) in D[0,M] for M<To, 
where B denotes a standard Brownian motion on [0,cx)) and C{t) ~ FQ{t)/ So{t). Let 
Xn{t) = 2^0 + n~^/^t and define 

Y'°-{t) = n^'^ [ %„(z;)-H„(xo)- / (/io(xo) + (w - a;o)/i[,(a;o)) dw j dz;. (5.10) 

J Xq V J Xo J 

It is not difficult to show that 

Y':%t) ^ VC^)J^ Wis) ds + ^Kixo)t\ 



MLE & LSE of a convex bathtub hazard 1031 



{Y'n'it) ^ ^/m^)W{t) + ^h'i{xo)t' 

in D[~M, M] for each fixed < M < oo, where is a two-sided Brownian raotion process 
starting at and C'{t) ~ ho{t) / So{t) . Next, define 

O0(X0j Jxo JxQ I hn{u) J 

•^Ola^oj Jxo Jxo hn{u) 

where dH,*('u) = dH„(M). Tlie derivative (Yj°<j^)'(t) is not difficuh to calculate. By 

consistency of /i„ and since supj |§,i(i) — 5o(t)| — > a.s., for any M > 0, 

hm sup |Y;-^(i) - Y':^{t)\ ^ hm sup HY^ZYW i^lTYm = a.s. (5.11) 

" |t|<A/ " \t\<M 

YJ^j^ is our driving process. 
Invelope process. 

Recall definitions (5.4). Our initial candidate for the invelope is defined as 

lJoc(^)^^4/5M^ r-'*' r Un(u)-/.o(xo)-(u-..o)fe[,(xo) U ^ ^^^^ 
So{xo) Jx„ Jx„ { hn{u) J 



I xo ■Jxo 



where 



Ki = -n'^^^^AHU^xo) - A,a(^o)} and 

^o\Xo) I Jq 

Notice that because of the presence of S„(t;) in its definition, I„'jJ^(i) is not three times 
differentiable. We therefore define 

l?*,ioc/,N 4/5 ^o(^o) T"^*^ r f ^»(m) - hojxo) - (m - a;o)fe;)(a-o) ) 

'-^^^^^^^5^1 11 M^:^ j5o(.)d.d. 

From Proposition 5.5, we have that for any M > 0, 

hm sup ittW-la'WI-O. (5.12) 

" \t\<M 



limit. The two arguments together prove that Z„, and hence i^'i"^' have the appropriate 
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The derivatives of will describe the limiting behavior of our estimators. First, 

though, we must show that this process converges to the invelopc. To do this, define the 
vector 

z„(i) (YLi(i), (YLi)'(i),i„:f ^(t), (i:t)'{t), Kir{t). kit^^)) (5-13) 

and fix M >Q. We will show that Z„ is tight in the product space 

E[-M,M] = C[-M,M] X D[~M,M] x C[-M,Mf x D[-M,M]. 

This will be done last. Wc first assume that Z„ has a weak limit and identify its unique 
limit. The two argum 
limiting distribution. 

Identifying the limit. 

It is sufficient to show that I*'']^'^ satisfies (1.2)-(1.4) in the limit. 
For condition (1.2), calculate 

with equality at the (non-increasing) touchpoints of /i„, using (2.2). By (5.12), it follows 
that I*'^'^ satisfies (1.2) in the limit. 

Next, the derivatives of I„*]'°'^(i) are calculated as follows: 

(I -'-)'(i) ^n^/^M^ r''' / - - - ^°)^o(^") Xs.ju) du + 1„ 

So[Xo) Jxo I hn{u) J 

^ 5o(.o)l /i„(.„(t)) 1*0(^0 + n t). 

Due to Theorem 3.1 and Proposition 5.5, we have that 

hm sup |(X:f^)''(t)~n2/5[/J„(xo + n-i/5t)-/io(xo)-n-i/5t/,[,(xo)]|=0, (5.15) 
" |t|<j\/ 

where n'^/^\hn{xo + n-^'H) - ho{xo) - n-^/Hh'f^{xo)] is convex, and hence the limit of 
{\*f°'^)"{t) will be convex. Thus, (1.3) is satisfied in the limit. 
Let Bn{t) = {ho{xo) / Soixo)) x {So(t)/hn{t)). We may then write 

(JntTit) = n'/nh'Mt)) Kix„)]BMt)) 

+ n'^^lKixnit)) - hoixo) - n-'^Hh'oixo)] x B'„{x„it))- 
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Notice that sup|j|<^y^ |1 — Bn{xa + n^^/^t)\ ->a.s. 0, with lim„ B[^{xa + n~^^^t) bounded. 
Therefore, from Proposition 5.5, it follows that 

hm sup \(i:l°y'{t)~n'/%'^{xo + n-''h)~h'^{x,)]\^Q, (5.16) 

" \t\<M 

where n^'^ [h'^ {xq + n~^'H) - h'^ (xq)] is piecewise constant, with jumps at the touchpoints 
of hn- By consistency of we have 

d(i::l°T{t) = Bnixo + n-^'h) dff„(0 

+ 2[/i;(xo + n-^/h) - h'^{xQ)]B[-^{xQ + dt 

+ n^'^\hr,{xa + - hoixQ) - n-^'^th'^{xQ)] AB[^{xq + n-^/H) 

= {s„(xo + n-^'H) + o;(ji-2/5)}dg„(i) + o;(n-i/5) d^, 

where g„(t) = n^/^[/i^(2;o + n~^/^t) — ^.[((xo)]. We say that a process X„(t) is 0*(1) if 
sup|t|<ji/l^nWI is Op(l). 

Next, fix a c > 0. Since hn is piecewise linear, it follows that d^„ puts mass only at 
the locations of touchpoints of /i„. However, at these locations, by (5.14), the process 
(0 - Yj^°<j^(t) is equal to zero. It follows that 

/jI„"iW-Y;,-^(i))d5n(i)-0. 

Hence, 

f{ Kt^ - ^n:m dfcf ')"'(^) = /' ( Kt^ ^nW) d[(i:r)"' - 

+ f{ Kt^ -la (i)) d5n(i) - 0,(1), 

using Proposition 5.5, (5.12) and the fact that ^„ is increasing. 

It remains to show that (1.4) is maintained under limits. This follows from the contin- 
uous mapping theorem since for any element z = {zi, Z2, z^, z^, Z5, zg} G E[—M, M], 

'>P{z)= / (Z3-Zi)dz6 
J-AI 

is continuous in z for zg increasing. We have thus shown that II„*i°'^(i) satisfies the 
invelope conditions (1.2)-(1.4) asymptotically. This shows that the only possible limit of 

inf°^{t) is the process I. 
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Tightness. 

We already know that Y^°\{t) and (Yj°'[)'(i) are tight in C[-M,A/] and D[--M,M], 
respectively. To address tightness of the invelope processes, note that bounded and in- 
creasing functions are compact in D[—M,M] and that bounded continuous functions 
with uniformly bounded derivatives are compact in C[—M,M]. These two facts allow us 
to address only stochastic boundedness of {t), i = 0, . . . , 3, to obtain tightness. 

Thus, Proposition 5.5, along with (5.15) and (5.16), says that (j*;l°'')"{t) and (l*;l°y{t) 
are tight. It remains to argue the same for (I„*^°'^)'(0 and ^*\°^{t)- However, this will 

follow by Proposition 5.5, and (5.12), if we can show that both An.i and An^^t + Bn^i 
are tight. 

Let r„ be the largest touchpoint smaller than xq. By (2.7), and after careful calcula- 
tions, we have 



n 



3/5f r° ^o(") " ^o(^o) " ^o(a:o)(M - Xq) ^^^^^^ 

Jr„ hn{u) 



1 



-n3/5 / ,^^d{F„-Fo}(w)+H'/' / §n{u)-So{u)du. 

hn{u) 



xo 



By Proposition 5.5, Lemma 5.3 and Theorem 3.1, the first two terms are tight in 
C[— A'/, Af]. Arguments similar to those used in the proof of Lemma 5.1, along with 
Lemma 5.3, may be used to handle the remaining terms. Since 'Hn,i{Tn) ~ /J"" A„_4_(?;) Av 

by Lemma 2.2. it follows that Bn,i is tight in C[~Af, M], which, in turn, implies that Z„ 
is tight in the space £'[— Af, Af]. 

From the invelope to Theorem 1.2. 

By (5.15) and (5.16), the limiting behavior of rt^/^(/i„(a;o) — /io(a;o)) and n^/^(ft,Jj(xo) — 
^o(^o)) is the same as that of the second and third derivatives of which converge 

to the invelope of lim„ Yj"^^. Define ki, k2 by 

t 2 



limY;°'^(t) = ^C'ixo) / W{s)ds+—h'^{xo)t^ = ki W{s)ds + k2t\ 

" Jo ^4 Jq 

For any a, 6 > 0, bY{at) ^ W{s) As + a^bf^. Therefore, choose a, h so that a'^b = ^2 

and ~ki. It follows that 

Y':i{t)^bY{at). 
Applying this rescaling to all processes shows that 

(I„*f')"(0)=^6a2l"(0) and (I„*f ^)"'(0) ^ &a3l'"(0). 
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It is now straightforward to calculate the correct constants, ci and C2, of Theorem 1.2. 
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